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BACKGROUND OF THE INVENTION 
[0001] The present invention relates to systems and methods for correcting errors on a data 
storage medium, and more particularly, to systems and methods for direct partial updates of 
Reed-Solomon CRC/ECC check bytes. 

[0002] In data storage devices and systems such as, for example, hard disk drives (HDD), the 
combination of poor write/read conditions and low signal-to-noise ratio (SNR) data detection is 
likely to cause a mixed error mode of long bursts of errors and random errors in data sectors 
stored on the disk. Typically, byte-alphabet, Reed-Solomon (RS) codes are used to format the 
stored sector data bytes into codewords protected by redundant check bytes and used to locate 
and correct the byte errors in the codewords. Long codewords are more efficient for data 
protection against long bursts of errors as the redundant check byte overhead is averaged over a 
long data block. However, in data storage devices, long codewords cannot be used, unless a 
read-modify-write (RMW) process is used, because a logical unit data sector is 512 bytes long 
and computer host operating systems assume a 5 1 2 byte long sector logical unit. Each RMW 
process causes a loss of a revolution of the data storage medium. Losing revolutions of the data 
storage medium lowers input/output (I/O) command throughput. Therefore, frequent usage of 
the RMW process becomes prohibitive because it lowers I/O command throughput. 
[0003] Rather than uniformly adding check bytes to short codewords to correct more random 
errors in the short codewords, U.S. Patent No. 5,946,328, issued on August 31, 1999, invented by 
Cox et al and assigned to International Business Machines Corporation, discloses a method and 
means for generating check bytes that are not rigidly attached to a short codeword but are shared 
by several short codewords in an integrated interleaved Reed-Solomon (RS) Error Correction 
Coding (ECC) format. Interleaving is a commonly used technique by which the bytes in a data 
sector are split into several byte streams, each of which is encoded separately, and thus 
constituting a short Reed-Solomon codeword. A reason for interleaving is to split the errors in a 
sector among several codewords, thus avoiding the need to build in hardware a very complex 
Reed-Solomon decoder that can correct a very large number of errors. In the presence of random 
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errors, distinct interleaves maybe affected different,,, to the effect ft* a sector can fad on-the- 
fly (OTF) correction dne to an excess of a small number of random errors in one uuerleave. At 
low SNR's the probability of such sector failures increases due to an uneven distrrbution of 
random errors among .he interleaves. U.S. Paten.No. 5,946,328 addresses mis specific problem 
of sector failures due to random errors exceeding the OTF correction capability in one mterleave 
by using shared check bytes in an integrated interleaving two-level ECC format. 
,00041 in US. Paten. No. 5,946,328, fire method and means for generating shared check bytes 
for a two-level ECC among a pluralily of interleaves is implemented by performing byte-by-byte 
summation of all interleaves prior to encoding as we., as me repeated byte-by-byte summation of 
all resulting codewords obtained after encoding. This requires .ha. the interleaved data smogs be 
simultaneously available for byte-by-byte summation, as is .he case when the combmed 
interleaves constitute a single data sector. Each individua, interleave, as we,, as their sum are 
encoded by Reed-Solomon encoders where the in.er.eave sum codeword has more check bytes 
to each individua. in.er.eave codeword. Summation of tire codewords produces a summed 
, interieave codeword .ha. is equally protected against random errors as all the otter interleave 
codewords. The summed interleave codeword has less message bytes at .he expense of 
addition, potential check bytes to be used in any interleave codeword with an excess of random 
errors provided .hat .he remaining interleave codewords do no. have errors in excess of .he OTF 
ECC capability. 

0 ,0005, The combination of low SNR detection and poor write/read conditions may result in 
both random errors as well as long bursts of byte errors ( "mixed error mode") becoming more 
and more likely at high ureal densities and low flying heights, which is the trend m HDD 
industry The occurrence of such mixed error mode combinations of random as well as burs, 
errors is likely to cause the 512 byte sector interleaved OTF ECC to fail resulting in a more 

,5 frequent use of a data recovery procedure (DRP) which invo.ves rereads, moving the head ete 
These DRP operations resu,, in the loss of disk revo.utions that causes a lower input/output (I/O) 
throughput. This performance loss is not acceptable in many applications such as aud.o-v.sua. 
(AV) data transfer, for example, which will no. .olerate frequent interruptions of v.deo data 
streams On the other hand, uniform protection of a., single sectors against both random as well 

30 as burs, errors, a, the 512 byte logical unit sector format, would result in excessive and 
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unacceptable check byte overheads. Such check byte overheads atso increase the soft error rate 
due to the increase in linear density of the data. 

,0006) Long block data ECC, such as 4 K byle physical block comprising eight sectors, for 
example, could be a solution for some applications, but it would require a change m the 
5 operating system standard, unless read-modify-wri.e (RMW) is accepted when writing stngle 
5,2 byte sectors. Present operating systems are all based on a 5 12 byte long sector logtca, unrt. 
RMW is required to update the long physical block check bytes. Thus, when a single 512 byte 
sector is written, ,he other sectors in the long block need to be read, the long block check bytes 
need to be recalculated, and the who.e long Hock is then rewritten. Hence, the RMW causes an 
,0 I/O throughput performance loss ma. is generally unacceptable for typical HDD operatton. 
,00071 Therefore, it would be desirable to have an ECC format for a data storage device that 
has a low sector faUure rate for .he mixed e^or mode of random etror and burs, error, .ha. avotds 
frequent DRP or RMW use, and that also has an acceptable check byte overhead. 
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BRIEF SUMMARY OF THE INVENTION 
,0008] The present invention provides techniques for correcting data bytes on a data storage 
m edium. Data bytes are stored in .he da«a s.orage medmm in sectors. Each sector contains we,, 
known cychc redundancy check (CRC) bytes and error correchon coding (ECC, check bytes ma. 
are used for detecting and correcting data bytes in that sector. The CRC and ECC bytes are 
generated by a CRC/ECC generator. Subsets of .he sectors are grouped into blocks. Each block 
includes ECC bytes that are used for correcting data bytes in any of.be sectors in .ha. block. 
,00091 According to .he present invention, a da.a storage system generates a long block 
membership (LBM) byte for each sector. The LBM byte indicates whether the sector is par, of a 
block of sectors or not. The data storage system of .he presen. invention adds .he LBM 
25 contributions to the CRC and ECC bytes and men attempts to correct the failed sec.or. If the 
correction ptocess is successful, the data storage system decfares a miscotrectton. 
100X0, If the correction process is not successful, the data storage system again adds me LBM 
contributions to the CRC and ECC bytes and then attempts to co.ec. the failed sector. If .he 
second correction process is success*.,, .he data correction is accepted if the error pattern and the 
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, i *«n a threshold value. If the second correction process is not 
check byte overlap is less than a threshold v«uu 

successful, the data storage system declares a hard error. 

BRIEF DESCRIPTION OF THE DRAWINGS 

controller that includes an on-the-fly error correchon eode (ECC) system 
1 0 the-fly error correction code; 

channel and a write ehanne. of the hard disk drive controller of F.gure A; 

100141 ^m^.-*^*'*^"^*""^ 
of the present invention; 

15 l00151 ^ s «-..«-*«-»«— 

embodiment of the present invention; 

m * Figures 3A and 3B iUustrate a process for encoding LBM, LB A, and data bytes to 
ZteCRCand ECChytes aecordingto anemood,me„ t of the present mvenhou, and 

100171 n^^^^^^'^^^ 
20 embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

i n f » hard disk drive control system for reading 
,00.8, Figures 1 A and IB illustrate an example of a ha d d 

■ t. j^cV Thp hard disk drive system ol rigures in. 
and writing data onto a magneto hard d* The « * J ^ ^ 

discussed in further detail below. 
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|001 „ Figure IA an exempt architect of a buffered hard dtsh confro, « £ 

Ld L* conrroner 50 is configured ro read data from aud write data ,o a ma^ettc ard dtsh 4. 
Courier 50 includes an on-the-fly error correction code (ECC) system 100 for 
an on-the-fly error correction code. Ou-the-fly error correctiou code system 100 iuc.udes au 
ECC read processor 163 and an ECC write processor 167. 

,0020, Figure IB is ab.odc diagram of the hard disk controiler 50 of Figure ,A ft* inchtdes 

wnt ,e„ onto .he dish ,4, they are placed temporarily in a buffer ,65 shown ,n F.gure 1A and 
subsequently processed and transduced along a write path or channe. (167, 169, and 157). 

[002!, Tire hard drive conlroUcr 50 inc.udes a ,ogic drive circuit .05 shown in Figure IB that 
oJsda.fromhardd i sha S semb,y33,forexamp,e & omSbi K .o32bi,, A d lsk a Ss emb^3 

i. comprised of dish .4 and head stach aasembues. A FIFO register 1 .0 stores the formatted 
aata and exchanges the same with a sector buffer ,20. Tire ECC system ,00 recetves the 
formatted data from the drive logic circuit ,05 and performs an error correction eodtng 
a lg ori.hm. ECC system ,00 can a,so perform the techniques and processes of the present 
invention, which are discussed in detail below. 

,0022, A buffer manager 1,5 contro.s data traffic between the ECC system ,00, a sector buffer 
. , r Lorn access memory) ,20, and a microprocessor ,25. Another F.FO regts er ,30 stor, 
a, and exchanges the same with the sector buffer ,20. A sequence control,* ,35 , conne d 

the sequence operation of the data traffic and various commands across the hard dnve controHe 
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,0023. Firs, a predeterntined number of binary data Cements, also termed bytes, ,„ a da a 
string are moved from the buffer ,65 and streamed through an ECC write processor 
ECC write processor .67, the data bytes are mapped into codewords drawn from a suttaMe hnear 
bl „ch or cyclic code such as a Reed-Solomon code. Next, each codeword is mapped m a wrtte 
path signaling unit 169 into a ntn ,ength Hmited or outer bandpass or specfrahshaptng «* 
L Iged into a time-varytng signal The time-varying signa, is app.ied through an tn.erface 
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read/wri.e .reducer interface ,57 and .hen eo the write e.emen. in a magneto resistive (or other 
suitable transducer head) for conversion into magnetic flux patterns. 

,0024, All of the measures starting from the movement of me binary data elements from buffer 
165 unti. the magnetic flux patterns are written on a sdec.ed disk trade as .he rotating 
passes under the read/write head are synchros and streamed. For purposes of efficten, data 
transfer, the data is de-staged (written out) or staged (read) a disk sector at a tune. 
,0025) Thus, both the mapping of binary data into Reed-Solomon codewords and the 
conversion to flux producing time-varying signals must be done we,, within the time in.erva 
defining a unit of recording track length moving under the transduce, Typtca. untts of recording 
track length are equal fixed-length byte sectors of 512 bytes. 

,0026, When sequences of magnetic flux patterns are to be read from the disk , 4, they are 
processed in a readpath or chamte, (,57, ,59, ,6,, and ,63) and written into thebuffer ,65. The 
tim e-varyng signals sensed by a transducer are passed through the read/write transducer 
iuterface ,57 to a digita, signa, extraction unit ,59. Here, the signa, is detected and a dectsto ,s 
mad e as to whether it shou,d he resoWed as a binary , or 0. As these V. and 0's stream ou, of the 
signa, extraction unit ,59, they are arranged into codewords in the formatting urn. 16,. 
,0027, Because the read path is evahtating sequences of Reed-So.omon codewords previous,, 
recorded on ,he disk ,4, absent error or erasure, the codewords shouid be the same, fit order to 
.est whether that is the case, each codeword is app.ied to an ECC read processor ,63 over a path 
from a formatter 161. 

[0028, A,so, the output from the ECC processor .63 is written into buffer ,65. The read path 
a,so operates in a synchronous data-streaming manner such that any detected etrots must be 
.ocated and corrected within the codeword we,, in time for the ECC read processor 63 ,0 
receive the next codeword read from the disk track. The buffer 165 and the read and wrtte 
channels may be monitored and control.ed by the microprocessor ,25 to ensu re efficacy where 
patterns of referencing may dictate that a path no. be taken down, such as sequentta, read 
referencing. 
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,0029, As discussed above, da, by.es are stored in a data storage disk in sector, Subsets of 
L sirs are grouped into ,on g b.ocks. Each sector contains firs, .eve, CRC/ECC bytes, and 
each block includes second level ECC B bytes. 

,0030, A data storage systenr uses .be firs, ,eve, sec.or CRC and ECC bytes .o ioea.e and ,o 
correct errors in da.a by.es wi.bin a sector. Wben .here are .oo many enors in the da, by.es o 
be coated by .he firs. ,eve, sector ECC by.es, the da,a storage sys.cn, uses d. second ,eve, 
Mock ECC B bytes to correc. Ure errors. ECC B hy.es are used for correcting da«a bytes m any of 
the sectors ,n .ha. btock. Miscorrection of an error can happen when bo.b of .he firs, .eve, and 
the second .eve. CRC and ECC bytes are in error. 

,0031, Further details of multiple level, integrated sector forma., error correction code 
encoding and decoding processes for data storage or communication devices is discussed tn 
further detai, in U.S. Paten, App.iea.ion Pubhcanon US 2003/0,47,67 A, , pubhshed Angus, 7, 
2003, to Asano et a,., which is incorporated by reference herem. 

(0032, A CRC/ECC generator generates the ECC and CRC bytes for a sector based on the da, 
Ls wri„en into Una, sector. Subsequently, the data storage system can npdato the data bytes tn 
a sector with new data bytes. Tire da,a storage system can write new data bytes into a sector 
even if me sector was originafiy par. of a long Mock of sectors, bn, me new databytes are no. 
linked to other sectors in the long block. 

,0033, When the data bytes are changed in the updated sector, new first level CRC and ECC 
bvtes are a,so generated and stored in the updated sector so ,ha. ,hey can be nsed to ,oca.e and 
dec, .he new da,a by.es. When the origina, firs. ,evel CRC and ECC check bytes ™d 
the da, storage system can no ,onger de.em.ine whether ,he updatod sector s pari of a b,ock 
sectors. The likelihood of such an even, is es.ima.en to be between 10 and 10 . 

,0034] The present invention solves .his prob.em by filtering on, sectors ,ha, have been 
: up aa.ed wi,h new da, bytes. The presen, invention provides a long block membership (LBM) 
bvte for each sector. The LBM byte for a parttcmar sector encodes information ,ha« mdtcates 
Iber ma, sector was written as a par, of a block of sectors or not. The CRC/ECC generator 
calculates ,he CRC and ECC bytes for a sec.or by adding in an incremen,, conntbunon from 
LBM byte to indicate that die sector is part of a long block. 
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needs to be updated or checked. 



neeus iu u^uuw 

r \ a o^Ptnr k tested for actual membership 
level decoder is successful, a miscorrection is declared, turtne 
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are now discussed. 

100371 

contribution is added into the CRC and ECC bytes agatn, w 
i„ the CRC/ECC bytes. Because the LBM eontribution has been nui.ed out, 
15 belongs to a long block. 

this sector did no. be,on g to a UocK and therefore, the second ieve, ECC B bytes 
accurately correct the error. 

, 1 , Pre bvtes fail to correct the error, the system again adds the LBM 
on mniQl Tfthe second level ECCb bytes iaiiiuw 

1 , ,„ the CRC and ECC bytes and attempts a second time to correct the error. If the 

information into the CRC/bUU oyu» 
25 ^membership information. This confirms tire re,evance o 

which me LBM btock membership information prec,se,y makes the d.fference. 
determined by the trust level. 
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mm For example ifthe threshold level is 5 10-bit symbols, the system reliability is 

>o->. ------ of ;;::r;r 8 

it front ,«» to to ", or from lO" 5 " to .0*, whiehever assumption one wants to make. The 
threshold is adjustable. 

,0041, According to an alternative embodiment of the present invention, the LBM byte is not 
LTdLinelmeraseo.orisamemberofa.onsb^ofseo^.mmtsembo^ent, 

specified nnmber of CRC and firs, .eve. ECC bytes in a sector that has faded the firs, .eve 
I! on process The specified number can be adjnsted at any time. This embodtment does 

o„ ly the second .eve, correction in the restricted si.na.ions where .akmg ou, the b,oc 
Ilership canses the second .eve.decoder to fad that the other embodtment prov.des. 
,0042, Figure , illustrates the relationshtp between the bytes associated with a sector 20 O 

by.es are addresses for the da.aby.es. The LBA bytes are no. written onto the d,sk. 

,0043, Sector 200 also inc.udes firs, .eve, CRC by.es for cor detection and first level ECC 

S.«^^»ii*---'-• U «^ Tta,,, " W, ■ ,l " 



not written onto the disk. 



) 



:5 



11UI WllUVll 

,0044, F,gure 2 i„us,ra.es by.es associated with a block of sectors. Each ^ 
1 rNofsec,ors(e,.,Ssec.or S , - — ^ h ^f-""a£, 
as, secor in a block inchrdes shared long block error co.ec.ion code check bytes (ECC B > 
I:d,o„gb,„ckECCby,e S areused,oeo,ec,errorsi„da,aby,esma,canno,becorrec,edb y 

the first level ECC byte in each sector. 

,0045, The ca,cu,a,ion of CRC and ECC bytes are we,, known aigorithms. The systems of 
Fibres 3 A and 3B modify the well known CRC and ECC calculations to achieve the benefits of 
the present invention. 

,0046, Figure 3A illustrates how the LBM byte confines .o me CRC and ECC bytes 

, ApUa CRC and ECC bytes based on contributions from tire LBM, 
CRC/ECC generator generates delta CRC and tL^oy. 

LBA, and data bytes. The de.ta CRC and ECC by.ee are added .o the CRC and ECC of a secto 



....og^eECCandCRC^^asshownin^aB. The ECC and CRC bytes are 
used in the decoding process as discussed below. 
,047] Figure 4 illust„^ 

m agnet 1C d.sk drive system such as the controller shown in Figures 1 A-1B. 

,0048! initially the process of Figure 4 attempts to correct a data error in a sector using the 
100481 Imtl Y ' w , Sector at step 401 A determination is then made as to whether 
first level ECC bytes in the failed sector at step 4U l . a u 

nrsi icvci " f . f t { el error correction 

the first level error correction was successful at step 403. If the 

♦ Tft^fi«t level error correction was not 
process is successful, the process terminates. If the first level error 

successful, the process proceeds to step 405. 

,0049, At step 405, the data storage system adds the LBM contributions (deha CRC and ECC 
Aisiep*w, T7; a nre IB The data storage 

bv.es) to dre CRC and ECC bytes for the failed sector as shown in Figure 3B. 

according to a well-known syndrome generatmg process. The data storage y 

tben makes a determinatton as to whether the decoding process successfully corrected 
in the sector at step 411. 

,0050, ,f the enor was successful corrected, the data storage system dedares a —on 
Z 413 At step 41 3, the process of Figure 4 has determined that data bytes in the failed 

contain the LBM contribution before step 405, and therefore, is no, par. of a long Mock, 
5 trhasbeenupda.edoroverwrinenwi.hda.b.esma.areno.pariof.heonginallongbloc, 

the LBM contributions to the failed sector a second time at slep415 cancels 0 " 
eon.rihu.ions that were added .0 the CRC and ECC by.es in the failed sector a, step 405. 
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,00*, The da,a s,orage system then generates scenes for the second ,ev 
ECC B by.es a. s.ep 417 according .0 a well-known syndrome generafng process. The 
aorage system then decodes the second lev., ECC. bytes at step 4.9 using a decodtng 

system declares a hard error at step 423. 

,0053, If the error was snccessfuUy corrected, the data storage system determines whether the 

byt es can correct. If the error partem and ECC overlap is less man or ec.ua, to * teWd 
lemesystemdeclaresamiscon-ecti on a, step «7, because me trust .nresho,d h, not e. 
deeded f the error paUem and ECC overlap is greater than the threshold value, the system 

,0054, Thus, the present invention provides techniques for corrccttng data by.es on a data 
^medium. A sec.or .ha, has been updated or overwntten with new da.a by.es may no. 
II long bio* of sectors. The present invention de.ee, whether a failed sector • 

.he present invention does no. use corrected data generated from the second level ECC B bytes 
lella m that faded sector. The present invention only uses corrected data generated from 

part of an original long block of data sectors. 

,00551 Whi.emepresen.inventionhasbeendescribedhereinwi,hreferencetopa rtl cu,ar 

in the present invention, .n some instances, features of the tnvention can be employed wuh u, 
S eorres ending use of otiter features, without departing from the scope of the mventton as set 

closed, without departing from the essentia, scope and spin, of the present — « • 
intended that the tnvention no, be limited ,0 me particular embod,men, d,sc,oseti but ,h Uh 
mention wiU include a« embodiments and e q u,valen,s falling within me scope of me c,a,m, 
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